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The "Power of One Qubit" refers to a computational model that has access to 
only one pure bit of quantum information, along with n qubits in the totally mixed 
state. This model, though not as powerful as a pure-state quantum computer, is ca- 
pable of performing some computational tasks exponentially faster than any known 
classical algorithm. One such task is to estimate with fixed accuracy the normal- 
ized trace of a unitary operator that can be implemented efficiently in a quantum 
circuit. We show that circuits of this type generally lead to entangled states, and 
we investigate the amount of entanglement possible in such circuits, as measured by 
the multiplicative negativity. We show that the multiplicative negativity is bounded 
by a constant, independent of n, for all bipartite divisions of the n + 1 qubits, and 
so becomes, when n is large, a vanishingly small fraction of the maximum possible 
multiplicative negativity for roughly equal divisions. This suggests that the global 
nature of entanglement is a more important resource for quantum computation than 
the magnitude of the entanglement. 



I. INTRODUCTION 



Fully controllable and scalable quantum computers are likely many years from realization. 
This motivates study and development of somewhat less ambitious quantum information 
processors, denned as devices that fail to satisfy one or more of DiVincenzo's five criteria 
for a quantum computer ^J. An example of such a quantum information processor is a 
mixed-state quantum system, which fails to pass DiVincenzo's second requirement, that the 
system be prepared in a simple initial state. 

The prime example of a mixed-state quantum information processor is provided by liquid- 
state NMR experiments in quantum information processing [2j]. Current NMR experiments, 
which operate with initial states that are highly mixed thermal states, use a technique called 
pseudo-pure-state synthesis to process the initial thermal state and thereby to simulate pure- 
state quantum information processing. This technique suffers from an exponential loss of 
signal strength as the number of qubits per molecule increases and thus is not scalable. 
There is a different technique for processing the initial thermal state, called algorithmic 
cooling which pumps entropy from a subset of qubits into the remaining qubits, leaving 
the special subset in a pure state and the remaining qubits maximally mixed. Algorithmic 
cooling provides an in-principle method for making liquid-state NMR — or any qubit system 



*Electronic address: animesh@unm.edu 

^Electronic address: sflammia@unm.edu 

* Electronic address: caves@info.phys.unm.edu 



2 



that begins in a thermal state — scalable, in essence by providing an efficient algorithmic 
method for cooling a subset of the initially thermal qubits to a pure state, thereby satisfying 
DiVincenzo's second criterion. 

Knill and Laflamme proposed a related mixed-state computational model, which they 
called DQC1, in which there is just one initial pure qubit, along with n qubits in the 
maximally mixed state. Although provably less powerful than a pure-state quantum com- 
puter 0, DQC1 can perform some computational tasks efficiently for which there are no 
known polynomial time classical algorithms. In particular, a DQC1 quantum circuit can be 
used to evaluate, with fixed accuracy independent of n, the normalized trace, tr(£7 n )/2 n , of 
any n-qubit unitary operator U n that can be implemented efficiently in terms of quantum 
gates In Sec.|n]we consider briefly whether there might be efficient classical algorithms 

for estimating the normalized trace, and we conclude that this is unlikely. The efficient quan- 
tum algorithm for estimating the normalized trace provides an exponential speedup over the 
best known classical algorithm for simulations of some quantum processes 0, IH . Knill and 
Laflamme referred to the power of this mixed-state computational model as the "power of 
one qubit." 

Study of the power of one qubit is motivated partly by NMR experiments, but our primary 
motivation in this paper is to investigate the role of entanglement in quantum computation, 
using DQC1 as a theoretical test bed for the investigation. For pure-state quantum com- 
puters, Jozsa and Linden [^| have shown that exponential speedup over a classical computer 
requires that entanglement not be restricted to blocks of qubits of fixed size as problem size 
increases. Entanglement that increases with problem size is thus a necessary prerequisite 
for the exponential speedup achieved by a pure-state quantum computer. On the other 
hand, the Gottesman-Knill theorem demonstrates that global entanglement is far from 
sufficient for exponential speedup. While this means that the role of entanglement is not 
entirely understood for pure-state quantum computers, far less is known about the role of en- 
tanglement in mixed-state quantum computers. When applied to mixed-state computation, 
the Jozsa-Linden proof does not show that entanglement is a requirement for exponential 
speedup. Indeed, it has not previously been shown that there is any entanglement in the 
DQC1 circuits that provide an exponential speedup over classical algorithms. 

The purpose of this paper is to investigate the existence of and amount of entanglement 
in the DQC1 circuit that is used to estimate the normalized trace. The DQC1 model consists 
of a special qubit (qubit 0) in the initial state |0)(0| = ^(Ii + Z), where Z is a Pauli operator, 
along with n other qubits in the completely mixed state, / n /2 n , which we call the unpolarized 
qubits. The circuit consists of a Hadamard gate on the special qubit followed by a controlled 
unitary on the remaining qubits 
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After these operations, the state of the n + 1 qubits becomes 

Pn + l = ^(|0)(0|®/ n +|l)(l|®4 + |0)(l|®f/t + | 1)(0 |^^ = ^^4 ^P) j (L2) 

where N = 2 n . The information about the normalized trace of U n is encoded in the expec- 
tation values of the Pauli operators X and Y of the special qubit, i.e., (X) = Re[tr(£7 ri )]/2 n 
and (Y) = - Im[tr ([/„)] /2 n . 
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To read out the desired information, say, about the real part of the normalized trace, 
one runs the circuit repeatedly, each time measuring X on the special qubit at the output. 
The measurement results are drawn from a distribution whose mean is the real part of 
the normalized trace and whose variance is bounded above by 1. After L runs, one can 
estimate the real part of the normalized trace with an accuracy e ~ 1/ \[L. Thus, to achieve 
accuracy e requires that the circuit be run L ~ 1 /e 2 times. More precisely, what we mean by 
estimating with fixed accuracy is the following: let P e be the probability that the estimate 
is farther from the true value than e ; then the required number of runs is L ~ ln(l/P e )/e 2 . 
That the number of runs required to achieve a fixed accuracy does not scale with number of 
qubits and scales logarithmically with the error probability is what is meant by saying that 
the DQC1 circuit provides an efficient method for estimating the normalized trace. 

Throughout much of our analysis, we use a generalization of the DQC1 circuit, in which 
the initial pure state of the special qubit is replaced by the mixed state + aZ), which 
has polarization a, 
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We generally assume that a > 0, except where we explicitly note otherwise. After the circuit 
is run, the system state becomes 
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The effect of subunity polarization is to reduce the expectation values of (X) and (Y) by a 
factor of a, thereby making it more difficult to estimate the normalized trace. Specifically, 
the number of runs required to estimate the normalized trace becomes L ~ ln(l/P e )/a 2 e 2 . 
Reduced polarization introduces an additional overhead, but as long as the special qubit 
has nonzero polarization, the model still provides an efficient estimation of the normalized 
trace. What we are dealing with is really the "power of even the tiniest fraction of a qubit." 

For n + 1 qubits, all states contained in a ball of radius r n+ i centered at the completely 
mixed state are separable 11, 0] (distance is measured by the Hilbert-Schmidt norm). 
Unitary evolution leaves the distance from the completely mixed state fixed, so at all times 
during the circuit (jl.3J) . the system state is a fixed distance ^/tr(pi — I n+ i/2N) 2 = a2~( n+1 ^ 2 
from the completely mixed state. This suggests that with a small enough, there might be 
an exponential speedup with demonstrably separable states. This suggestion doesn't pan 
out, however, because the radius of the separable ball decreases exponentially faster than 
2-(n+i)/2_ fj^g Iqq^ known lower bound on r n is 2 x Q~ n / 2 ^3|; for the system state to be 
contained in a ball given by this lower bound, we need a < 2 x 3~( n+1 )/ 2 . The exponential 
decrease of a means that an exponentially increasing number of runs is required to estimate 
the normalized trace with fixed accuracy. More to the point, the possibility that the actual 
radius of the separable ball might decrease slowly enough to avoid an exponential number 
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of runs is ruled out by the existence of a family of n-qubit entangled states found by Diir et 
al. H, which establishes an upper bound on r n that goes as 2 x 2 n for large n, implying 



that a < 2 x 2~( n+1 " 2 if the system state is to be in the ball given by the upper bound. 
These considerations do not demonstrate the impossibility of an exponential speedup using 
separable states, but they do rule out the possibility of finding such a speedup within the 
maximal separable ball about the completely mixed state. 

We are thus motivated to look for entanglement in states of the form (jl.5j) . for at least 
some unitary operators U n . Initial efforts in this direction are not encouraging. It is clear 
from the start that the marginal state of the n unpolarized qubits remains completely 
mixed, so these qubits are not entangled among themselves. Moreover, in the state (jl.5|) . 
as was shown in Ref. 0], the special qubit is unentangled with the n unpolarized qubits, 
no matter what U n is used. To see this, one plugs the eigendecomposition of the unitary, 
U n = J2j e ^ j \ e j)( e j\i m t° the expression for p n+ i(a). This gives a separable decomposition 

Pn+i{u) = ^^(K-Xoil + \ b j)( b j\) ® \ e j)( e j\ , (1-6) 

3 

where \aj) = cos6>|0) + e 1 ^ sin0|l) and \bj) = sin#|0) + e 1 ^ cos#|l), with sin26 l = a. 

No entanglement of the special qubit with the rest and no entanglement among the rest- 
where then are we to find any entanglement? We look for entanglement relative to other 
divisions of the qubits into two parts. In such bipartite divisions the special qubit is grouped 
with a subset of the unpolarized qubits. To detect en tang lement between the two parts, 
we use the Peres- Horodecki partial transpose criterion [la, [ly] , and we quantify whatever 
entanglement we find using a closely related entanglement monotone which we call the mul- 
tiplicative negativity ^?|- The Peres-Horodecki criterion and the multiplicative negativity 
do not reveal all entanglement — they can miss what is called bound entanglement — but we 
are nonetheless able to demonstrate the existence of entanglement in states of the form (|1.2|) 
and (|1.5|) . For convenience, we generally refer to the multiplicative negativity simply as the 
negativity. The reader should note, as we discuss in Sec. IHIl that the term "negativity" was 
originally applied to an entanglement measure that is closely related to, but different from 
the multiplicative negativity. 

The amount of entanglement depends, of course, on the unitary operator U n and on the 
bipartite division. We present three results in this regard. First, in Sec. I1V1 we construct a 
family of unitaries U n such that for a > 1/2, p n+1 (a) is entangled for all bipartite divisions 
that put the first and last unpolarized qubits in different parts, and we show that for all such 
divisions, the negativity is (2a + 3)/4 for a > 1/2 (5/4 for a = 1), independent of n. Second, 
in Sec. |V[ we present numerical evidence that the state p n+ \ of Eq. (jl.2|) is entangled for 
typical unitaries, i.e., those created by random quantum circuits. For n + 1 = 5, . . . , 10, we 
find average negativities between 1.155 and just above 1.16 for the splitting that puts [n/2\ 
of the unpolarized qubits with qubit 0. Third, in Sec. I VII we show that for all unitaries and 
all bipartite divisions of the n + 1 qubits, the negativity of p n+ \ (a) is bounded above by the 
constant yl + a 2 (\/2 ~ 1.414 for a = 1), independent of n. Thus, when n is large, the 
negativity achievable by the DQC1 circuit (jl.l|) becomes a vanishingly small fraction of the 
maximum neg ativity, ~ 2 n / 2 , for roughly equal bipartite divisions. 

The layout of the paper is as follows. In Sec. |H] we examine the classical problem of 
estimating the normalized trace of a unitary. In Sec. IIHI we review pertinent properties of 
the negativity before applying it to obtain our three key results in Sees. IIVHVII We conclude 
in Sec. IVIII and prove a brief Lemma in an Appendix. Throughout we use A to stand for the 
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partial transpose of an operator A relative to a particular bipartite tensor-product structure, 
and we rely on context to make clear which bipartite division we are using at any particular 
point in the paper. 

II. CLASSICAL EVALUATION OF THE TRACE 

In this section we outline briefly a classical method for evaluating the trace of a unitary 
operator that can be implemented efficiently in terms of quantum gates, and we indicate 
why this appears to be a problem that is exponentially hard in the number of qubits. 

The trace of a unitary matrix U n = U is the sum over the diagonal matrix elements of U: 

tr(l7)=X>|l7|a>. (2.1) 

a 

Here a is a bit string that specifies a computational-basis state of the n qubits. By factoring 
U into a product of elementary gates from a universal set and inserting a resolution of the 
identity between all the gates, we can write tr(Z7) as a sum over the amplitudes of Feynman 
paths. A difficulty with this approach is that the sum must be restricted to paths that begin 
and end in the same state. We can circumvent this difficulty by preceding and succeeding 
U with a Hadamard gate on all the qubits. This does not change the trace, but does allow 
us to write it as 

tr(U) = (a|^1b)(b|[/|c)(c|^|a) = 1 ^(-l) a -( b+c )(b|f/|c) . (2.2) 

a,b,c a,b,c 

Now if we insert a resolution of the identity between the elementary gates, we get tr(U) 
written as an unrestricted sum over Feynman-path amplitudes, with an extra phase that 
depends on the initial and final states. 

Following Dawson et al. ^If, we consider two universal gate sets: (i) the Hadamard 
gate H, the 7r/4 gate T, and the controlled-NOT gate and (ii) H and the Toffoli gate. 
With either of these gate sets, most of the Feynman paths have zero amplitude. Dawson et 
al. [la] introduced a convenient method, which we describe briefly now, for including only 
those paths with nonzero amplitude. One associates with each wire in the quantum circuit a 
classical bit value corresponding to a computational basis state. The effect of an elementary 
gate is to change, deterministically or stochastically, the bit values at its input and to 
introduce a multiplicative amplitude. The two-qubit controlled-NOT gate changes the input 
control bit x and target bit y deterministically to output values x and y®x, while introducing 
only unit amplitudes. Similarly, the three-qubit Toffoli gates changes the input control bits 
x and y and target bit z deterministically to x, y, and z © xy, while introducing only unit 
amplitudes. The T gate leaves the input bit value x unchanged and introduces a phase 
e ix?r/4_ rj^g Hadamard gate changes the input bit value x stochastically to an output value 
y and introduces an amplitude (— l) xy / \/2. 

The classical bit values trace out the allowed Feynman paths, and the product of the 
amplitudes introduced at the gates gives the overall amplitude of the path. In our application 
of evaluating the trace ([2.2)1 . a path is specified by n input bit values (which are identical to 
the output bit values), n random bit values introduced by the initial Hadamard gates, and 
h random bit values introduced at the h Hadamard gates required for the implementation 
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of U. This gives a total of 2n + h bits to specify a path and thus 2 2n+h allowed paths. We 
let x denote collectively the 2n + h path bits. 

If we apply the gate rules to a Hadamard-Toffoli circuit, the only gate amplitudes we 
have to worry about are the ±l/\/2 amplitudes introduced at the Hadamard gates. There 
being no complex amplitudes, the trace cannot be complex. Indeed, for this reason, achieving 
universality with the if- Toffoli gate set requires the use of a simple encoding, and we assume 
for the purposes of our discussion that this encoding has already been taken into account. 
With all this in mind, we can write the trace (|2.2j) as a sum over the allowed paths, 

X 

Here ^(x) is a polynomial over Z2, specifically, the mod-2 sum of the products of input and 
output bit values at each of the Hadamard gates. The downside is that a string of Toffoli 
gates followed by a Hadamard can lead to a polynomial that is high order in the bit values. 
As pointed out by Dawson et al. [IH, we can deal with this problem partially by putting a 
pair of Hadamards on the target qubit after each Toffoli gate, thus replacing the quadratic 
term in the output target bit with two new random variables and preventing the quadratic 
term from iterating to higher order terms in subsequent Toffoli gates. In doing so, we are 
left with a cubic term in ip(x) from the amplitude of the first Hadamard. The upshot is that 
we can always make t/*(x) a cubic polynomial. 
Notice now that we can rewrite the trace as 



tr(C/) 



2n+h/2 



number of x such\ / number of x such 
that ifj(x) =0 J ~ \ that ifj(x) = 1 



(2.4) 



thus reducing the problem of evaluating the trace exactly to counting the number of zeroes 
of the cubic polynomial ^(x). This is a standard problem from computational algebraic 
geometry, and it is known that counting the number of zeroes of a general cubic polynomial 
over any finite field is #P complete 19]. It is possible that the polynomials that arise 
from quantum circuits have some special structure that can be exploited to give an efficient 
algorithm for counting the number of zeroes, but in the absence of such structure, there is no 
efficient classical algorithm for computing the trace exactly unless the classical complexity 
hierarchy collapses and all problems in #P are efficiently solvable on a classical computer. 

Of course, it is not our goal to compute the trace exactly, since the quantum circuit only 
provides an efficient method for estimating the normalized trace to fixed accuracy. This 
suggests that we should estimate the normalized trace by sampling the amplitudes of the 
allowed Feynman paths. The normalized trace, 

x 

which lies between —1 and +1, can be regarded as the average of 2 2n+h quantities whose 
magnitude, 2 h ^ 2 , is exponentially large in the number of Hadamard gates. To estimate the 
average with fixed accuracy requires a number of samples that goes as 2 h , implying that 
this is not an efficient method for estimating the normalized trace. The reason the method 
is not efficient is pure quantum mechanics, i.e., that the trace is a sum of amplitudes, not 
probabilities. 
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If we apply the gate rules to a Hadamard-T-controlled-NOT circuit, the bit value on each 
wire in the circuit is a mod-2 sum of appropriate bit values in x, but now we have to worry 
about the amplitudes introduced by the Hadamard and T gates. The trace (|2.2|) can be 
written as 

tr{U) -^E^^H- 1 )^ • ( 2 - 6 ) 

X 

Here </>(x) is a polynomial over Z 2 , obtained as the mod-2 sum of the products of input and 
output bit values at each of the Hadamard gates. Since the output value is a fresh binary 
variable and the input value is a mod-2 sum of bit values in x, 0(x) is a purely quadratic 
polynomial over Z 2 . The function x( x ) is a niod-8 sum of the input bit values to all of the 
T gates. Since these input bit values are mod-2 sums of bit values in x, x( x ) is linear in bit 
values, but with an unfortunate mixture of mod-2 and mod-8 addition. We can get rid of 
this mixture by preceding each T gate with a pair of Hadamards, thus making the input to 
the every T gate a fresh binary variable. With this choice, x( x ) becomes a mod-8 sum of 
appropriate bit values from x. 

We can rewrite the sum ()2.6|) in the following way: 

number of x such that\ / number of x such that\ 
X(x) = j and 0(x) = 0J \x(*) = 3 and 0(x) = 1J ' 

(2.7) 

Thus the problem now reduces to finding simultaneous (binary) solutions to the purely 
quadratic Z 2 polynomial 0(x) and the purely linear Zg polynomial x( x )- O ne has to be 
careful here to note that we are only interested in binary solutions, so we are not solving 
x(x) = j over all values in Z§. The number of solutions of a purely quadratic polynomial 
over Z 2 can be obtained trivially jl^ , but the constraint over Z 8 means that one must count 
the number of solutions over a mixture of a field and a ring. The complexity class for this 
problem is not known, but given the equivalence to counting the number of solutions of 
a cubic polynomial over Z 2 , it seems unlikely that there is an efficient classical algorithm. 
Moreover, an attempt to estimate the normalized trace by sampling allowed paths obviously 
suffers from the problem already identified above. 



III. PROPERTIES OF NEGATIVITY 

In this section we briefly review properties of negativity as an entanglement measure, fo- 
cusing on those properties that we need in the subsequent analysis (for a thorough discussion 
of negativity, see Ref. [l7| ) . 

Let A be an operator in the joint Hilbert space of two systems, system 1 of dimension d\ 
and system 2 of dimension d 2 . The partial transpose of A with respect to an orthonormal 
basis of system 2 is defined by taking the transpose of the matrix elements of A with respect 
to the system-2 indices. A partial transpose can also be defined with respect to any basis 
of system 1. Partial transposition preserves the trace, and it commutes with taking the 
adjoint. 

The operator that results from partial transposition depends on which basis is used to 
define the transpose, but these different partial transposes are related by unitary transfor- 
mations on the transposed system and thus have the same eigenvalues and singular values. 
Moreover, partial transposition on one of the systems is related to partial transposition on 
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the other by an overall transposition, which also preserves eigenvalues and singular values. 
Despite the nonuniqueness of the partial transpose, we can talk meaningfully about its in- 
variant properties, such as its eigenvalues and singular values. Similar considerations show 
that the eigenvalues and singular values are invariant under local unitary transformations. 

The singular values of an operator O are the eigenvalues of v 0^0 = \0\ (or, equivalently, 
of yOCft). Any operator has a polar decomposition O — T\0\, where T is a unitary operator. 
Writing \0\ — W'SW, where W is the unitary that diagonalizes \0\ and S is the diagonal 
matrix of singular values, we see that any operator can be written as O = VSW, where 
V = TW^ and W are unitary operators. 

We denote a partial transpose of A generically by A. We write the eigenvalues of A 
as Xj{A) and denote the singular values by Sj(A). If A is Hermitian, so is A, and the 
singular values of A, i.e., the eigenvalues of \A\, are the magnitudes of the eigenvalues, i.e, 
s j (A) = \\ j (A)\. 

If a joint density operator p of systems 1 and 2 is separable, its partial transpose p is a 
positive operator. This gives the Peres-Horodecki entanglement criterion (Til if p has a 



negative eigenvalue, then p is entangled (the converse is not generally true). The magnitude 
of the sum of the negative eigenvalues of the partial transpose, denoted by 

\j(p)<0 

is a measure of the amount of entanglement. Partial transposition preserves the trace, so 
tr(p) = 1, from which we get 

1 + 2A/-(p) = £ \Xj{p)\ = tr\p\ = M(p) , (3.2) 

3 

where M.{p) is a closely related entanglement measure. The quantity W(p) was originally 
called the negativity [17[; we can distinguish the two measures by referring to -M(p) as the 
multiplicative negativity, a name that emphasizes one of its key properties and advantages 
over M{p). In this paper, however, we use the multiplicative negativity exclusively and so 
refer to it simply as "the negativity". 

The negativity A4(p) equals one for separable states, and it is an entanglement mono- 
tone [17|, meaning that (i) it is a convex function of density operators and (ii) it does not 
increase under local operations and classical communication. The negativity has the prop- 
erty of being multiplicative in the sense that the M. value for a state that is a product of 
states for many pairs of systems is the product of the M. values for each of the pairs. By 
the same token, log.M(p), called the log-negativity, is additive, but the logarithm destroys 
convexity so the log- negativity is not an entanglement monotone ^t}- For another point of 
view on the monotonicity of the log- negativity, see Ref . . 

The minimum value of the negativity is one, but we need to know the maximum value 
to calibrate our results. Convexity guarantees that the maximum value is attained on pure 
states. We can find the maximum 21] by considering the Schmidt decomposition of a joint 
pure state of systems 1 and 2, 

d 

W) = Y,T/ih\3>3), (3-3) 

3=1 
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where d = min(di, d 2 ). Taking the partial transpose of p relative to the Schmidt basis of 
system 2 gives 



d 

P=Y1 \/Pb'i k)(k,j\ , (3.4) 



j,k=i 

with eigenvectors and eigenvalues 



\j,j) , eigenvalue p h 
k ) ± 3)) ' eigenvalue ± y /p~p^, j < k. 



(3.5) 



This gives a negativity 



d / d s 2 

M(^) = 1 + 2 ^ = ^ = ( XI v 7 /^ J • 



(3.6) 



The concavity of the square root implies £7. \/Wi — Va, with equality if and only if ^ = 1/d 
for all j, i.e., \ip) is maximally entangled. We end up with 

l<M(p)<d. (3.7) 

The negativity is the sum of the singular values of p. For states of the form we are 
interested in, given by Eq. (|1.5jl . the negativity is determined by the singular values of the 
partial transpose of the unitary operator U n . To see this, consider any bipartite division of 
the qubits. Performing the partial transpose on the part that does not include the special 
qubit, we have 

Pn + i(a) = ^( I p f ^ , (3.8) 
2N \aU n I n J 

where U n is the partial transpose of U n relative to the chosen bipartite division. Notice that 
if we make our division between the special qubit and all the rest, then U n = t/J is a unitary 
operator, and p n+ i(a) is the quantum state corresponding to using U% in the circuit (|1.3|k 
this shows that for this division, the negativity is 1, consistent with our earlier conclusion 
that the special qubit is not entangled with the other qubits. For a general division, we 
know there are unitaries V and W such that U n = VSW, where S is the diagonal matrix of 
singular values Sj(U n ). This allows us to write 

pn+l{a) = {o v)m{as i n ) [o Vi) ' (3 ' 9) 

showing that p n+ \{a) is a unitary transformation away from the matrix in the middle and 
thus has the same eigenvalues. The block structure of the middle matrix makes it easy 
to find these eigenvalues, which are given by [1 ± aSj(U n )]/2N. This allows us to put the 
negativity in the form 

1 N 1 N 

M(p n+l {a)) = — J2 I 1 + a*i(#n)| + |1 - aSj(U n )\ = —J2 msoc (\ a \ s iftn), l) , (3-10) 

J'=l 3=1 
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which is valid for both positive and negative values of a. An immediate consequence of 
Eq. (j3.10|) is that M.(p n +\{ai)) = M. (p„+i(— «)) , as one would expect. Since p n+ i(a) 
is a mixture of p n+ i(+l) = p n+ i and p„ + i(— 1), convexity tells us immediately that 
< .M(/V|_i), i.e., that a mixed input for the special qubit cannot increase 
the negativity over that for a pure input. More generally, we have that the negativity 
cannot decrease at any point as a increases from to 1. 



IV. ENTANGLEMENT IN THE DQC1 CIRCUIT 



In this section, we construct a family of unitaries U n that produce global entanglement in 
the DQC1 circuit (jl.lj) . For a — 1, the negativity produced by this family is equal to 5/4, 
independent of n, for all bipartite divisions that put the first and last unpolarized qubits in 
different parts. We conjecture that this is the maximum negativity that can be achieved in 
a circuit of the form (jl.lj) . 

Before the measurement, the output state of the circuit (jl.3j) is given by Eq. (jl.5j) . To 
construct the unitaries U n , we first introduce a two-qubit unitary matrix 



D, B 1 



(4.1) 



where A 1; Bi, C%, and D\ are single-qubit (2 x 2) matrices that must satisfy A\Ai -\- D\Di = 
B\Bi + C\C\ = l\ and A\C\ + D\B\ = to ensure that 4 2 is unitary. The n-qubit unitary 
U n is then defined by 



U„ 



A n _ 2 ® C x 

4-2 ® 5i 



|0> <0| ® 4_2 ® A + ® 4-2 ® B a 

+ |0> < 1 1 ®X n _ 2 ®Ci + |1)(0| ®X n _ 2 ( 



£>1 



(4.2) 



Here we use Xi, Yi, and Z\ to denote single-qubit Pauli operators. A subscript k on the 
identity operator or a Pauli operator denotes a tensor product in which that operator acts 
on each of k qubits. If we adopt the convention that X = I = 1, then U n reduces to 4 2 
when n — 2. It is easy to design a quantum circuit that realizes U n . The structure of the 
circuit is illustrated by the case of 4 4 : 



— 1 


> 4 
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1st qubit | | [ 77~ | J | ( 4 - 3 ) 
4th qubit 
2nd qubit 
3rd qubit 

In general, the two-qubit unitary {7 2 , acting on the first and last qubits, is bracketed by 
controlled-NOT gates from the first qubit, acting as control, to each of the other qubits, 
except the last, as targets. 

Because 1% and X\ are invariant under transposition, it is clear from the form of U n that 
in the state (jl.5|) . all qubits, except 0, 1, and n, are invariant under transposition. We can 
use this fact to find the negativity for all bipartite divisions. First consider any bipartite 
division that puts qubits 1 and n in the same part. There are two possibilities. If the special 



11 



qubit is in the same part as qubits 1 and n, then partial transposition on the other part 
leaves p n+ i(a) unchanged, so the negativity is 1. If the special qubit is not in the same 
part as 1 and n, then partial transposition on the part that includes 1 and n is the same as 
partial transposition of all the unpolarized qubits, a case we already know to have negativity 
equal to 1. We conclude that any bipartite division that puts 1 and n in the same part has 
negativity equal to 1. 

Turn now to bipartite divisions that put qubits 1 and n in different parts. There are two 
cases to consider: (i) the special qubit is in the same part as qubit 1, and (ii) the special 
qubit is in the same part as qubit n. In case (i), partial transposition of the part that 
contains qubit n gives 

Pn+1 {a) = 2N [aU n I n ) Wlth Un = U- 2 ® DJ 4- 2 ® Bf ) ■ ^ 
In case (ii), partial transposition of the part that contains qubit 1 gives 

" ( \ 1 f In aU£\ ., , fr ( In-2 ® A 1 X n _ 2 ® DA 

The basic structure of p n+ i(a) is the same in both cases. Without changing the spectrum, 
we can reorder the rows and columns to block diagonalize p n+1 {a) so that there are N/4 
blocks, each of which has the form 

where pz(ot) is the appropriate partial transpose of the three-qubit output state. Thus the 
spectrum of p n+ i(a) is the same as the spectrum of /^(a), except that each eigenvalue is 
reduced by a factor of 4/N. In calculating the negativity, since each eigenvalue is (iV/4)-fold 
degenerate, the reduction factor of 4/N is cancelled by a degeneracy factor of N/4, leaving 
us with the fundamental result of our construction, 

M(p n+1 (a)) =M(p 3 (a)) . (4.7) 

This applies to both cases of bipartite splittings that we are considering, showing that all 
divisions have the same negativity as the corresponding n = 2 construction. 
We now specialize to a particular choice of U 2 given by 

*-(»!), *-(;«), *-(»;), - A-gg). m 

For this choice, the two cases of bipartite division lead to the same partial transpose. The 
spectrum of 

f h aA 1 
h ad aB 1 
olA x aDi h 
\ad aB x h J 



& (a) 



(4.9) 



is 



Spec(p 3 (a)) = \{l + 2a, 1, 1, 1, 1, 1, 1 - 2a) , (4.10) 



12 



giving a negativity equal to 1 for a < 1/2 and a negativity 

M(p n+1 (a)) =M(p 3 {a)) = ^(2a + 3) for a > 1/2 . (4.11) 

This result shows definitively that the circuit can produce entanglement, at least for a > 1/2. 
We stress that the negativity achieved by this family of unitaries is independent of n > 2. 

For a = 1, the negativity achieved by this family reduces to 5/4. For large n, this amount 
of negativity is a vanishingly small fraction of the maximum possible negativity, ~ 2 n//2 , for 
roughly equal divisions of the qubits. This raises the question whether it is possible for other 
unitaries to achieve larger negativities. A first idea might be to find two-qubit unitaries U2 
that yield a higher negativity M.(pz) = M.(p n+ i) when plugged into the construction of this 
section, but the bounds we find in Sec. |V3 dispose of this notion, since they show that 5/4 
is the maximum negativity that can be achieved for n = 2. Another approach would be to 
generalize the construction of this section in a way that is obvious from the circuit ()4.3|) . i.e., 
by starting with a fc-qubit unitary in place of the two-qubit unitary of Eq. ()4.3|) . Numerical 
investigation of the case k = 3 has not turned up negativities larger than 5/4. We conjecture 
that 5/4 is the maximum negativity that can be achieved by states of the form (jl.2j) . Though 
we have not been able to prove this conjecture, we show in the next section that typical 
unitaries for n + 1 < 10 achieve negativities less than 5/4 and in the following section that 
the negativity is rigorously bounded by y/2. 

We stress that we are not suggesting that the construction of this section, with £7 2 given 
by Eq. (|4.8|) . achieves the maximum negativity for all values of a, for that would mean that 
we believed that the negativity cannot exceed 1 for a < 1/2, which we do not. Although 
we have not found entanglement for a < 1/2, we suspect there are states with negativity 
greater than 1 as long as a is large enough that p n+ i(a) lies outside the separable ball 
around the maximally mixed state, i.e., a > 2 <n+1 - ) / 2 r n+ i. The bound of Sec. IVII only says 
that M. (p n+ i(a)) < yl + a 2 , thus allowing negativities greater than 1 for all values of a 
except a = 0. Moreover, since the negativity does not detect bound entanglement, there 
could be entangled states that have a negativity equal to 1. 



V. THE AVERAGE NEGATIVITY OF A RANDOM UNITARY 

Having constructed a family of unitaries that yields a DQC1 state with negativity 5/4, 
a natural question to ask is, 'What is the negativity of a typical state produced by the 
circuit (|1.1|) ?" To address this question, we choose the unitary operator in the circuit (|1.1|) 
at random and calculate the negativity. Of course, one must first define what it means 
for a unitary to be "typical" or "chosen at random". The natural measure for defining 
this is the Haar measure, which is the unique left-invariant measure for the group U(iV) 
22]. The resulting ensemble of unitaries is known as the Circular Unitary Ensemble, or 



CUE, and it is parameterized by the Hurwitz decomposition [23] . Although this is an exact 
parameterization, implementing it requires computational resources that grow exponentially 
in the size of the unitary [24j. To circumvent this, a pseudo-random distribution that requires 
resources growing polynomially in the size of the unitary was formulated and investigated 



in Ref. 24]. This is the distribution from which we draw our random unitaries, and we 



summarize the procedure for completeness. 
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We first define a random SU(2) unitary as 

, . / e**cos0 e ix sm6\ , . 

where 9 is chosen uniformly between and n/2, and and \ are chosen uniformly between 
and 2tt. A random unitary applied to each of the n qubits is then 

n 

R = (gWi,&,Xi), (5-2) 
i=i 

where a separate random number is generated for each variable at each value of i. Now 
define a mixing operator M in terms of nearest-neighbor Z ® Z couplings as 

M = exp ^ Yl Zij) ® Z(j+1) j • ( 5 - 3 ) 

The pseudo-random unitary is then given by 

R i MR i _ 1 ---MR 2 MR 1 , (5.4) 

where j is a positive integer that depends on n, and each R& is chosen randomly as described 
above. For a given n, the larger j is, the more accurately the pseudo-random unitary 
distribution resembles the actual CUE. From the results in Ref. [24], j = 40 gives excellent 
agreement with the CUE for unitary operators on at least up to 10 qubits, so this is what 
we use in our calculations. 

Due to boundary effects, not all bipartite splittings that put k unpolarized qubits in 
one part are equivalent. Nevertheless, we consider only bipartite divisions that split the 
qubits along horizontal lines placed at various points in the circuit of Eq. (jl.lj) . We refer 
to the division that groups the last k qubits together as the (n + 1 — k, k) splitting. For 
a = 1, we calculate the average negativity and standard deviation of a pseudo-random 
state p n+ i for two different bipartite splittings, (n, 1) and ([n/2\ + 1, [n/2]). These results 
are plotted in Fig. ^ For n + 1 = 5, . . . , 10, the average negativity for the roughly equal 
splitting lies between 1.135 and just above 1.16. The standard deviation appears to converge 
exponentially to zero, as in Ref. j25[, a behavior that is typical of asymptotically equivalent 
matrices. In addition, for 9 + 1 qubits, we calculate the average negativity and standard 
deviation for all nontrivial (k ^ n) bipartite splittings (n + 1 — k, k), and the results are 
shown in Fig. |21 



VI. BOUNDS ON THE NEGATIVITY 

In this section, we return to allowing the special qubit in the circuit (jl.3|) to have initial 
polarization a. Since the value of n is either clear from context or fixed, we reduce the 
notational clutter by denoting the state p n+ i(a) of Eq. (jl.5|) as p a . 

Given a particular bipartite division, the partial transpose of p a with respect to the part 
that does not include the special qubit is 



(6.1) 
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FIG. 1: Left: Average negativity of the state p n +i of Eq. (jl.2|) (a = 1) for a randomly chosen 
unitary U n for two different bipartite splittings, (n, 1) and ([n/2j + 1, \n/2\). The (n, 1) splitting 
appears to reach an upper bound quickly, whereas the other splitting is still rising slowly at 10 
qubits. Right: Semi-log plot of the standard deviation in the negativity of the randomly chosen 
state p n +\. The fit curves show that the standard deviation is decaying exponentially, so that for 
large numbers of qubits, almost all unitaries give the same negativity. 



where 

6 s {If) ■ < 6 - 2 ' 

Using the binomial theorem, we can expand tr(p^) in terms of tr(C k ): 

tr(^)=(^jE(l)« fctr (^)- ( 6 - 3 ) 

When k is odd, C is block off-diagonal, so its trace vanishes. When k is even, we have 

tT{C k ) = 2ti( K (U n Ui) k/2 ) ■ (6.4) 

When k = 2, this simplifies to tr((7 2 ) = 2tr{U n Ul) = 2tr([7 n [/t) = 2N. The crucial step here 
follows immediately from the property tr(AB) = tr(AB), which we prove as a Lemma in the 
Appendix. Note that in general trf^^ . . . A{) ^ tv(AiA2 . . . A{) if I > 2, so we cannot give 
a similar general calculation of tr(p*) for even s > 4, since it involves terms of this form. 

Using Eq. (J6.3J) . we can now obtain three independent constraint equations on the eigen- 
values \j = Xj(pa) of the partial transpose p a : 

2N 1 

£ A* = tr(#) = — ^[(1 + af + (1 - a) s ] , a = 1, 2, 3. (6.5) 
i=i 

Since the negativity is given by 
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Negativity v. Bipartite Splitting Number for a 9+1 Qubit DQC1 State 
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FIG. 2: Average negativity of the state pio of Eq. (|1.2|) (a = 1) for a randomly chosen unitary 
Ug as a function of bipartite splitting number k for bipartite splittings (10 — k, k). The error bars 
give the standard deviations. The function attains a maximum when the bipartite split is made 
between half of the qubits on which the unitary acts. 

we can find an upper bound on the negativity by maximizing X^l-^jl subject to the con- 
straints (J6.5j) . If we consider only the s = 1,2 constraints, we obtain a nontrivial upper 
bound on the negativity with little effort. We find that adding the constraint s = 3 adds 
nothing asymptotically for large N, but for small N yields a tighter bound than we get from 
the s — 1,2 constraints, although this comes at the cost of considerably more effort. We 
emphasize that these bounds apply to all bipartite divisions and to all unitaries U n . Notice 
that we have no reason to expect these bounds to be saturated, since the traces of higher 
powers of p a impose additional constraints that we are ignoring. The one exception is the 
case of three qubits, where the s = 1, 2, 3 constraints are a complete set, and indeed, in this 
case, the s = 1, 2, 3 bound is 5/4, which is saturated by the unitary found in Sec. II VI 

The remainder of this section is devoted to calculating the s — 1,2 and s — 1, 2, 3 upper 
bounds. A graphical summary of our results is for a = 1 presented in Fig. El 

A. The s = 1,2 Bound 

We can use Lagrange multipliers to reduce the problem to maximizing a function of one 
variable, but first we must deal with the absolute value in Eq. To do so, we assume 

that t of the eigenvalues are negative and the 2N — t others are nonnegative, where t becomes 
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Negativity v. Number of Qubits 
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FIG. 3: Plot of the bounds on the negativity of states of the form (jl,2j) . i.e., for a pure-state 
input in the zeroth register (a = 1). The uppermost plot is the simple analytic bound = \/2, 
obtained using the s = 1, 2 constraint equations; the next largest plot is the numerically constructed 
s = 1,2,3 bound. One can see that the s = 1,2,3 bound asymptotes to the s = 1,2 bound. As 
noted in the text, these bounds are independent of the unitary U n and the bipartite division. The 
flat line shows the negativity 5/4 for the state constructed in Sec. IIVI currently the state of the 
form Q1.2|) with the largest demonstrated negativity; notice that for n + 1 = 3, this state attains 
the s = 1,2,3 bound. The lowest two sets of data points display the expected negativities for a 
randomly chosen unitary using the bipartite splittings (n, 1) and (L^/2j +1, |~n/2~|), which were 
also plotted in Fig. ^ 



a parameter that must now be included in the maximization. We want to maximize 

t 2N 

1=1 j=t+l 

subject to the constraints 

2N 2N 1 + 2 

^A, = l and ]TA2 = ^-. (6.8) 

k=l k=l 

The notation we adopt here for the indices is that i labels negative eigenvalues and j labels 
nonnegative eigenvalues, while k can label either. This serves to remind us of the sign of an 
eigenvalue just by looking at its index. 

Introducing Lagrange multipliers \x and v, the function we want to maximize is 

t 2N I 2N \ / 2N 2 \ 

i=l j=t+l \k=l J \k=l J 
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Differentiating with respect to Aj and then Xj, we find 

- 1 +fi + 2u\i = , (6.10) 
+ l + /i + 2z/A j = 0. (6.11) 

We immediately see that in the maximal solution, all the negative eigenvalues are equal, 
and all the nonnegative eigenvalues are equal. We can now reformulate the problem in the 
following way. If we call the two eigenvalues A_ and A+, our new problem is to maximize 

Mi i2 = 5^|A fc | = -tA_ + (2N-t)X+ , (6.12) 

k 

subject to the constraints 

tA_ + (2N-t)X+ = 1 , (6.13) 

tXl + (2N-t)Xl = 1±^. (6.14) 

We can now do the problem by solving the constraints for A_ and A + in terms of t, plugging 
these results into .Mi 2, and then maximizing over t. 

Before continuing, we note two things. First, t cannot be 2N, for if it were, then all the 
eigenvalues would be negative, making it impossible to satisfy Eq. (|6.13j) . Second, unless 
a — 0, t cannot be 0, for if it were, then all the eigenvalues would be equal to 1/2N by 
Eq. Ijfi.lUjl . a situation Eq. ()6.14j) says can occur only if a = 0. Since we are not really 
interested in the case a = 0, for which p a is always the maximally mixed state, we assume 
a > and < t < 2N in what follows. 

Solving Eqs. ()6.13|) and ()6.14|) and plugging the solutions into Eq. ()6.12|) . we get the two 
solutions 

Mi, 2 = ^ • (6.15) 

We choose the positive branch, since it contains the maximum. Maximizing with respect to 
t treated as a continuous variable, we obtain the upper bound, 

Mi t2 = Vl + a 2 a = X \[2 ~ 1.414 , (6.16) 
which occurs when the degeneracy parameter is given by 

t = Nil - - 1 ] 0.292A^ . (6.17) 

The numbers on the right are for the case a — 1, corresponding to the special qubit starting 
in a pure state. Notice that the upper bound ()6.16|) allows a negativity greater than 1 for 
all a except a = 0. 

Since we did not yet enforce the condition that t be a positive integer, the bound (|6.16|) 
can be made tighter for specific values of iV and a by calculating t and checking which of 
the two nearest integers yields a larger Asymptotically, however, the ratio t/N can 

approach any real number, so this bound for continuous t is the same as the bound for 
integer t in the limit iV — ► 00. 
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B. The s = 1,2,3 Bound 

To deal with this case, we again make the assumption that t of the eigenvalues are negative 
and 2N — t are nonnegative and thus write 

t 2N 

■Mi,2,3 = -^Ai+ A i > ( 6 - 18 ) 

i=i j=t+i 

as before. In addition to the constraints (|fi.8j) . we now have a third constraint 

k=l 

We specialize to the case a = 1 for the remainder of this subsection, because it is our main 
interest, and the algebra for the general case becomes difficult. 

Introducing three Lagrange multipliers, we can write the function we want to maximize 

as 



t 2N / 2N \ / 2N \ / 2N \ 

i=l j=t+l \fc=l / \Jfc=l / \Jfc=l / 



Differentiating with respect to \ and then Xj gives 



(6.20) 



- 1 + n + 2u\i + 3£A, 2 = , (6.21) 
+ 2v\j + 3^A| = . (6.22) 

These equations being quadratic, we see that there are at most two distinct negative eigen- 
values and at most two distinct nonnegative eigenvalues. Since the sum of the two solutions 
of either of these equations is — 2z//3£, however, we can immediately conclude either that one 
of the potentially nonnegative solutions is negative or that one of the potentially negative 
solutions is positive. Hence, we find that at least one of the four putative eigenvalues has 
the wrong sign, implying that there are at most three distinct eigenvalues, though we don't 
know whether one or two of them are negative. 

Labelling the three eigenvalues by A, B, and C, we can reduce the problem to solving 
the three constraint equations, 

uA + vB + wC = 1 , 
uA 2 + vB 2 + wC 2 = 1/N , (6.23) 
uA 3 + vB 3 + wC 3 = 1/N 2 , 

for A, B, and C and then maximizing A^i,2,3 over the degeneracy parameters u, v, and w, 
which are nonnegative positive integers satisfying the further constraint 

u + v + w = 2N . (6.24) 



We do not associate any particular sign with A, B, and C; the signs are determined by the 
solution of the equations. 
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One might hope that the symmetry of Eqs. ()6.23|) would allow for a simple analytic 
solution, but this appears not to be the case. In solving the three equations, one is inevitably 
led to a sixth-order polynomial in one of the variables, with the coefficients given as functions 
of u, v, and w. Rather than try to solve this equation, which appears intractable, we elected 
to do a brute force optimization for any given value of 2N by solving Eqs. ()6.23j) for each 
possible value of u, v, and w. Picking the solution that has the largest negativity then yields 
the global maximum. We did this for each N up to 2N = 78. The values of u, v, and w 
that maximize the negativity are always 



u 



1, w = 2N-l-u, (6.25) 



where [x] denotes the integer nearest to x. The unique eigenvalue corresponding to v = 1 
is the largest positive eigenvalue, w is the degeneracy of another positive eigenvalue, and 
u is the degeneracy of the negative eigenvalue. Notice that the degeneracy of the negative 
eigenvalue is exactly what was found in the s = 1, 2 case. Using the results (|6.25|) as a 
guide, we did a further numerical calculation of the maximum for larger values of N, by 
considering only the area around the degeneracy values given by Eq. (|6.25|) . While this is 
not a certifiable global maximum, the perturbation expansion described below matches so 
well that the two are indistinguishable if they are plotted together for n + 1 > 7. This gives 
us confidence that the numerically determined upper bound .M 1,2,3, which we plot in Fig. |31 
is indeed a global maximum for all N. 

We have used the numerical work to help formulate a perturbation expansion that gives 
the first correction to the N — > 00 behavior of the s — 1, 2, 3 bound. Defining x = 1/N, we 
rewrite the constraint equations (|6.23|) as 

aA + bB + cC = x , 
aA 2 + bB 2 + cC 2 = x 2 , (6.26) 
aA 3 + bB 3 + cC 3 = x 3 , 

where a = u/N, b = v/N, and c — w/N. We also have the constraint 

a + 6 + c = 2. (6.27) 

As x is the variable that is asymptotically small, we seek an expansion in terms of it. 

Our numerical work tells us that there are two positive eigenvalues, one of which is larger 
and nondegenerate. In formulating our perturbation expansion, we let B and C be the 
positive eigenvalues, with B being the larger one, having degeneracy v = b\ > 1. We do 
not assume that 61 is 1, as the numerics show, but rather let the equations force us to that 
conclusion. With this assumption, the form of the constraints (|6.26|) shows that the variables 
have the following expansions to first order beyond the N — > 00 form: 

a = do + dix 1 / 3 , 

b = hx , (6.28) 

and 



C = Cq + c\x l l 3 



A = A x + A lX i/3 , 

B = B lX 2/3 , (6.29) 
C = C x + dx A/3 . 
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We see that we are actually expanding in the quantity y = x 1//3 . In terms of these variables, 
the negativity is given by 

-aA + bB + cC 

M lj2 ,3 = 

x 

= -a A + c C + {-a Q A x - a x A + c Q C x + c^x 1 ^ + 0(x 2/s ) , (6.30) 

which we now endeavor to maximize. 

Substituting Eqs. ()6.28|) and ()6.29|) into the constraints (|6.26|) and ()6.27|) and equating 
terms with equal exponents of x, we obtain, to zero order, 

a o + c o — 2 , 

a A + c C = 1 , (6.31) 
a Al + c Co = 1 . 

Solving for a , c , and C in terms of A and substituting the results into the zero-order 
piece of A^i,2,3 gives 

1 - AA + 2Al . 

Mi » = i-U +2 £ ■ (6 - 32) 

Maximizing Eq. ()6.32|) gives Aq = 1/2 and, hence, A = —l/y/2, since A is the negative 
eigenvalue. This leads to a = 1 — l/y/2, Co = 1 + l/y/2, and Co = l/y/2, and the resulting 
N — > oo upper bound is M. i j2j 3 = y/2, as expected. 

If we carry this process out to first order beyond the iV — > oo behavior, we obtain, after 
some algebraic manipulation, -Mi 5 2,3 = y/2 — 6^ 3 x 1 / 3 /2 7 / 6 + 0(x 2 / 3 ). Maximizing this simply 
means making b\ as small as possible, i.e., choosing b\ = 1, whence we obtain the following 
asymptotic expression for the s = 1,2,3 upper bound: 

M ^ = ^-w^ + °{i^) ■ (6 - 33) 

This shows that the upper bound of y/2 is approached monotonically from below in the 
asymptotic regime. In addition, the procedure verifies that in the maximum solution, the 
largest positive eigenvalue is nondegenerate. For the case of qubits we have iV = 2", implying 
that the approach to the N — > oo bound is exponentially fast. 



VII. CONCLUSION 

The mixed-state quantum circuit ()1.3j) provides an efficient method for estimating the 
normalized trace of a unitary operator, a task that is thought to be exponentially hard on 
a classical computer. If one believes that global entanglement is the essential resource for 
the exponential speedup achieved by quantum computation, then the question begging to 
be answered is whether there is any entanglement in the circuit's output state (|1.5|) . The 
purpose of this paper was to investigate this question. 

A notable feature of the circuit (J1.3)) is that it provides an efficient method for estimating 
the normalized trace no matter how small the initial polarization a of the special qubit in 
the zeroth register, as long as that polarization is not zero. Since all the other qubits are 
initially completely unpolarized, we are led to characterize the computational power of this 
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circuit as the "power of even the tiniest fraction of a qubit." We provide preliminary results 
regarding the entanglement that can be achieved for a < 1. Our results are consistent 
with, but certainly do not demonstrate the conclusion that separable states cannot provide 
an exponential speedup and that entanglement is possible no matter how small a is. The 
question of entanglement for subunity polarization of the special qubit deserves further 
investigation. 

Our key conclusions concern the case where the special qubit is initially pure (a = 1). 
We find that the circuit (jl.lj) typically does produce global entanglement, but the amount 
of this entanglement is quite small. Using multiplicative negativity to measure the amount 
of entanglement, we show that as the number of qubits becomes large, the multiplicative 
negativity in the state ()1.2|) is a vanishingly small fraction of the maximum possible mul- 
tiplicative negativity for roughly equal splittings of the qubits. This hints that the key to 
computational speedup might be the global character of the entanglement, rather than the 
amount of the entanglement. In the spirit of the pioneering contribution of Wyler |26|, what 
happier motto can we find for this state of affairs than Multum ex Parvo, or A Lot out of A 
Little. 
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APPENDIX A: PROOF OF THE LEMMA 



Lemma: tr(AB) = tr(AB). 



Proof: Define two operators A and B by 

A = a ij)k i\ij){kl\ , B = ^ b mn<pg \mn)(pq\ . (Al) 

i,j,k,l m,n,p,q 

Taking the partial transpose with respect to the second subsystem, we find 

A = ajj,ki\H)(kj\ , B = ^ b mnm \mq)(pn\ . (A2) 

i,j,k,l m,n,p,q 

Calculating the quantities of interest, we find that they are indeed equal. 

^ij,kl"mn,pq 

(pn\il)(kj\mq) = ^ 0-ij,kfikl,ij i (A3) 

i,j,k,l, i,3Al 
m, n, p, q 

tr (AB) = 

Q J ij,kl"mri,pq 

(pq\ij) (kl\mn) = ^ 

(A4) 

i,j,k,l, i,j,k,l 
m,n,p,q 
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